UPS delivers optimal phase diagram in high-dimensional variable selection
نویسندگان
چکیده
منابع مشابه
High Dimensional Variable Selection.
This paper explores the following question: what kind of statistical guarantees can be given when doing variable selection in high dimensional models? In particular, we look at the error rates and power of some multi-stage regression methods. In the first stage we fit a set of candidate models. In the second stage we select one model by cross-validation. In the third stage we use hypothesis tes...
متن کاملPhase Diagram for Variable Selection and Non-optimal Regions for L and L Penalization Methods
Consider a linear model Y = Xβ + z, z ∼ N(0, In), where the rows of X are iid samples from N(0,Σ), with Σ being a p by p matrix. It is believed that only a small fraction of the coordinates of β is nonzero, and we are interested in identifying such coordinates. We adopt an asymptotic framework where both p are n are large. In certain ranges, we find that the above problem reduces to a normal me...
متن کاملFeature Selection by Higher Criticism Thresholding: Optimal Phase Diagram
We consider two-class linear classification in a high-dimensional, low-sample size setting. Only a small fraction of the features are useful, the useful features are unknown to us, and each useful feature contributes weakly to the classification decision – this setting was called the rare/weak model (RW Model) in [11]. We select features by thresholding feature z-scores. The threshold is set by...
متن کاملHigh Dimensional Variable Selection with Error Control
Background. The iterative sure independence screening (ISIS) is a popular method in selecting important variables while maintaining most of the informative variables relevant to the outcome in high throughput data. However, it not only is computationally intensive but also may cause high false discovery rate (FDR). We propose to use the FDR as a screening method to reduce the high dimension to ...
متن کاملVariable Selection for High Dimensional Multivariate Outcomes.
We consider variable selection for high-dimensional multivariate regression using penalized likelihoods when the number of outcomes and the number of covariates might be large. To account for within-subject correlation, we consider variable selection when a working precision matrix is used and when the precision matrix is jointly estimated using a two-stage procedure. We show that under suitabl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Annals of Statistics
سال: 2012
ISSN: 0090-5364
DOI: 10.1214/11-aos947